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ABSTRACT 

We present a Bayesian approach to modelling galaxy clusters using multi-frequency pointed 
observations from telescopes that e xploit the Sunyaev-Ze rdovich effect. We use the recently 
developed MultiNest technique dFeroz & Hobsonl20 08; Feroz, Hobson & Bridges 2008) to 
explore the high-dimensional parameter spaces and also to calculate the Bayesian evidence. 
This permits robust parameter estimation as well as model comparison. Tests on simulated 
Arcminute Microkelvin Imager observations of a cluster, in the presence of primary CMB 
signal, radio point sources (detected as well as an unresolved background) and receiver noise, 
show that our algorithm is able to analyse jointly the data from six frequency channels, sample 
the posterior space of the model and calculate the Bayesian evidence very efficiently on a 
single processor. We also illustrate the robustness of our detection process by applying it to a 
field with radio sources and primordial CMB but no cluster, and show that indeed no cluster 
is identified. The extension of our methodology to the detection and modelling of multiple 
clusters in multi-frequency SZ survey data will be described in a future work. 

Key words: methods: data analysis - methods: statistical - cosmology:observations - galax- 
ies: clusters: general - cosmic microwave background 



1 INTRODUCTION 

Clusters of galaxies are the most massive gravitationally bound ob- 
jects in the Universe and as such are critical tracers of the formation 
of large-scale structure. The size and formation history of massive 
clusters is such that the ratio of cluster gas mass to total mass is ex- 
pected to be representative of the universal ratio fib/fim, once the 
relatively small amount of baryonic matter in the cluster galaxies is 
taken into account (see e.g. lWhite et al. 1993). The comoving num- 
ber density of clusters as a function of mass and redshift is expected 
to be particu larly sensitive to the co smological parameters erg and 
flm (see e.g. iBatt ve & Weller 20 031). This has been predicted both 
analytically (see e.g. IPress & Schechterl 1 19741 ; ISheth et al.ll200lh 
and from large-scale nu merical simulations (see e.g. ljenkins et al.l 
l200ll : lEvrard et al.l2002h . but cluster number densities have not yet 
been measured at redshifts z > 1, because of the basic problem of 
the dimming of surface brightness with redshift. Moreover, optical 
observations are confused by foreground galaxies, and both optical 
and X-ray observations are biased towards strong mass concentra- 
tions such as clumps. 

The Suny aev-Zel'dovich (SZ, Sunvaev & Zeldovich 1970, 

19721 : see e.g. iBirkinshawl 1 19991 and ICarlstrom. Holder & R eese 
20021 for reviews) effect produces secondary anisotropies in 



the cosmic microwave background (CMB) radiation through 
inverse-Compton scattering from the electrons in the hot in- 
tracluster gas (which radiates via thermal Bremsstrahlung in 
the X-ray waveband), and the subsequent transfer of some 
of the energy of the electrons to the low-energy pho- 
tons. Pointed SZ observation s of clusters have been rou- 
tine for some time (see e.g. iBirkinshaw. Gull & Moffetl Il98ll: 



| Birkinshaw, Gull & Hardebeckl Il984l: lUsonl Il98fj, Uones et al 
1993 1) . IGrainge et al.l Il996l. iGrainge et al.l l2002kb, ICotter et~ 



Grainger et al.l 2002 , ISaunders et alj 120031, Jones et al 



2002a,b. 

20051 ICarlstrom, Jov & Gregdll99dlGrego et al.ll2000l.lPatel et al 
2000. iReeseet all |2000|. |jov et all |200lL iReese et al.l \20oX 



Bonamente et alj |2006| . iLaRoque et al.l 120061) . By fitting simple 
parametric cluster models to the observed data set, one would like 
to make model-dependent inferences about the cluster parameters, 
i.e. to calculate the probability distribution of these parameters. We 
also wish to compare different cluster models to enhance our astro- 
physical understanding. These tasks are most conveniently carried 
out through Bayesian inference. 
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iMarshall et alj d2003l) presented a Bayesian approach for the 
joint analysis of pointed SZ and weak lensing data. They used 
a highly effective but computationally intensive Markov Chain 
Monte Carlo sampler to explore the high-dimensional param- 
eter space and employed the thermodynamic integration tech- 
nique to calculate the Bayesian evidence. We have since ex- 
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tended this approach to incorporate the hig hly efficient parameter- 
space sampling method, MultiNest dFeroz & HobsonI 1200 it 
Feroz et all [2008). allowing us to analyse efficiently cluster ob- 
servations in the presence of multiple radio sources. The analy- 
sis can now be done on multi-frequency SZ data. The new gen- 
eration of SZ instruments including A CT |kosowskv 2006) , AM I 
jAMI Consortium: Zwart et alj 120081) . AMiBA jLi et alj l2006h . 
APEX-SZ jDobbs et alj|2006h . CARMA (www . mmarray . org) 
and SPT dRuhl et alj |2004y, all have multiple frequency chan- 
nels. Our algorithm includes a more sophisticated model for the 
radio sources by allowing for their spectral indices to be non- 
flat. We also take into account the noise contribution coming 
from the population of faint unsubtracted radio sources. Further- 
more, the Bayesian evidence is now handled comprehensively and 
use d for objective and quantita tive detection of clusters (follow- 
ing lHobson & Mc Lachlan 2003) as well as for model selection be- 
tween different cluster models. In this paper we restrict our focus 
to the analysis of pointed SZ observations, but our basic method- 
ology can be extended to the detection and modelling of multiple 
clusters in multi-frequency SZ survey data, and will be described 
in a forthcoming publication. 

In Section|2]we briefly describe the AMI telescope. In Section 
[3]we give an introduction to the Bayesian inference. Sections|4]and 
[5] describe the SZ effect and our analysis methodology including 
those features described above. In Section [6] we apply our algo- 
rithm to simulated SZ cluster data and we present our conclusions 
in Section|7] 



2 THE ARCMINUTE MICROKELVIN IMAGER 

Although the cluster modelling method we present in this paper 
is quite general in nature, a principal goal of this work has been 
to develop an efficient and robust technique for analysing clus- 
ter SZ observations made b y the Arcminute Microkelvin Image r 
(AMI, iKneissl et al] 1200 ll. [AMI Consortium: Zwart et alj l2008h . 
This instrument consists of a pair of interferometer arrays op- 
erating currently with six frequency channels spanning 13.9- 
18.2 GHz for observations on angular scales of 30"-10'. The tele- 
scope is aimed principally at SZ imag ing of clusters of galaxies 
(AMI Consortium: Grainge e t al. 2006). In order to couple to the 
extended SZ flux from cluster gas structures, the Small Array (SA) 
has a large filling factor and thus excellent temperature sensitivity. 
The dominant contaminant in SZ observations at these Rayleigh- 
leans frequencies are the radio sources (see also Section [53}. The 
Large Array (LA) deliberately has higher resolution and better flux 
sensitivity than the SA and, observing simultaneously with the SA 
both in time and frequency, allows subtraction of such radio point 
sources. SA and LA parameters are summarized in Table[T] 



3 BAYESIAN INFERENCE 

Our cluster modelling methodology is built upon the principles of 
Bayesian inference; we now give a summary of this framework. 
Bayesian inference methods provide a consistent approach to the 
estimation of a set of parameters © in a model (or hypothesis) H 
for the dataD. Bayes' theorem states that 



Table 1. AMI technical summary. 





SA 


LA 


Antenna diameter 


3.7 m 


12.8 m 


Number of antennas 


10 


8 


Baseline lengths (current) 


5-20 m 18-110 m 


Primary beam (15.7 GHz) 


20'. 1 


5'.5 


Synthesized beam 




«30" 


Flux sensitivity 


30 mJy s~ 


1 ' 2 SmJys-Va 


Observing frequency 


13 


9-18.2 GHz 


Bandwidth 




4.3 GHz 


Number of channels 




6 


Channel bandwidth 




0.72 GHz 



where Px{@\D, H) = P(&) is the posterior probability distri- 
bution of the parameters, Pr(D|@, H) = C(&) is the likeli- 
hood, Pi(&\H) = 7r(@) is the prior probability distribution, and 
Pr(D\H) = Z is the Bayesian evidence. 

In parameter estimation, the normalising evidence factor is 
usually ignored, since it is independent of the parameters 0, and 
inferences are obtained by taking samples from the (unnormalised) 
posterior using standard MCMC sampling methods, where at equi- 
librium the chain contains a set of samples from the parameter 
space distributed according to the posterior. This posterior consti- 
tutes the complete Bayesian inference of the parameter values, and 
can, for example, be marginalised over each parameter to obtain 
individual parameter constraints. 

In contrast to parameter estimation, for model selection the 
evidence takes the central role and is simply the factor required to 
normalize the posterior over 0: 



Z 



£(©)7r(0)d D 0, 



(2) 



where D is the dimensionality of the parameter space. As the av- 
erage of the likelihood over the prior, the evidence is larger for 
a model if more of its parameter space is likely and smaller for a 
model with large areas in its parameter space having low likelihood 
values, even if the likelihood function is very highly peaked. Thus, 
the evidence automatically implements Occam's razor: a simpler 
theory with compact parameter space will have a larger evidence 
than a more complicated one, unless the latter is significantly bet- 
ter at explaining the data. The question of model selection between 
two models Ho and Hi can then be decided by comparing their re- 
spective posterior probabilities, given the observed data set D, via 
the model selection ratio R: 

Primp) Pr(0|#i)Pr(ffi) Z 1 Pt{H 1 ) 



R 



(3) 



Pr(&\D,H) 



Pr(D\ &,H)Pr{&\H) 
Pr(D\H) ' 



(1) 



Pr(H \D) Pr{D\H )Pr(H ) Z Pv{H y 

where Pr(Hi)/ Pr(Ho) is the a priori probability ratio for the two 
models, which can often be set to unity but occasionally requires 
further consideration. 

Various alternative inform ation c r iteria for astrophysical 
model selection are discussed by Liddle (2007), but the evidence 
remains the preferred method. However, evaluation of the multidi- 
mensional integral in l[2j is a challenging numerical task. Standard 
techniques like thermodynamic integration are extremely compu- 
tationally expensive, which makes evidence evaluation at least an 
order-of-magnitude more costly than parameter estimation. Some 
fast approximate methods have been used for evidence evaluation, 
such as treating th e posterior as a mu ltivariate Gaussian centred 
at its peak (see e.g. IHobson et alj|2002h . but this approximation is 
clearly a poor one for multimodal posteriors (except perhaps if one 
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performs a separate Gaussian approximation at each mod e). The 
Savag e-Dickey density ratio has also been proposed (see e.g. lTrott3 
2007) as an exact, and potentially faster, means of evaluating evi- 
dences, but is restricted to the special case of nested hypotheses and 

a separable prior on the model parameters. 

The nested sampling approach, introduced bv lSkillindl2004h . 
is a Monte-Carlo method targeted at the efficient calculation of the 
evidence, but also prod uces posterior inferenc es as a by-product. 
iFeroz & Ho bson ( 2008f) and lFeroz et alj d2008h built on this nested 
sampling framework, and have recently introduced the MULTI- 
NEST algorithm which is very efficient in sampling from posteriors 
that may contain multiple modes and/or large (curving) degenera- 
cies, and also calculates the evidence. This technique has greatly 
reduced the computational cost of Bayesian parameter estimation 
and model selection, and is employed in this paper. 



4 SUNYAEV-ZEL'DOVICH EFFECT FROM CLUSTERS 

The primary anisotropics in the CMB are roughly one part in 10 5 
and reflect the intrinsic non-uniformity of the matter and radia- 
tion just before the Universe cooled sufficiently to form transparent 
atomic gas. The secondary anisotropies in the CMB are due to pro- 
cesses affecting the CMB after its emission. Much the most relevant 
for this work is the thermal SZ effect. 

The gas temperature in galaxy clusters is 10 7 -10 8 K. With 
the radius r of the cluster, the electron number density n e , the op- 
tical depth for Thomson scattering through the centre of the clus- 
ter is r ~ 2rn c <TT ~ 10~ , where <tt the Thomson scatter- 
ing cross-section. On average, energy is transferred from the elec- 
trons to the scattered CMB photons, with the fractional energy in- 
crease approximately equal to feT '/m c c 2 ~ 10~ 2 with fee the 
Boltzmann constant, T and m c the electron mass and tempera- 
ture respectively and c the speed of light. These factors combine to 
give fractional C MB temperature flu ctuations of order 10 -4 . A full 
treatment (see e.g. B irkins hawl 1 9991) yields the following modifica- 
tion to the CMB surface brightness in the direction of the reservoir 
of electrons: 

SI V = f(u)yB u (TauB), (4) 

where B v (7/cmb ) is the blackbody spectrum at Tcmb = 2.726K, 
while the frequency-dependent function f(y) in the limit of non- 
relativistic plasma is given as 

x [srcoth(a:/2) - 4] 



/(") = 



1 



where 



(5) 



(6) 



with h p the Planck constant. At frequencies below 217 GHz, the 
SZ effect is observed as a flux decrement. The 'Comptonisation' 
parameter y is the integral of the gas pressure along the line of 
sight I through the cluster: 



V = 



m e c 2 



n c &b T dl. 



(7) 



The integral y of this Comptonisation parameter over a cluster's 
solid angle dft = dA/Dg, with Dg the angular-diameter distance 
to the cluster, is proportional to the total thermal energy content of 
the cluster: 



y 



y d£l - 



(7 T 

m c c 2 



rickBT dl dtt tx 



1J Z 



n c TdV, (8) 



where dV = dAdl is an element of comoving volume. The in- 
tegrated SZ surface brightness simply depends on the total cluster 
mass M, since, 



D 2 



D 2 
u 6 



D 2 



(9) 



where we have assumed that the cluster gas mass is proportional to 
its total mass, i.e. M g = f s M. 

One final property of the SZ effect is particularly significant: 
for a given cluster, the SZ surface brightness is independent of red- 
shift. The redshift only enters via the angular-diameter distance 
Dg(z), which at intermediate redshifts (0.5 < z < 6) is only 
weakly dependent on redshift for a concordance cosmology. There- 
fore, an SZ survey is expected to find all high-redshift clusters 
above some mass threshold with little dependence on the redshift. 



5 MODELLING INTERFEROMETRIC SZ DATA 

The majority of SZ observations to date have been made 
with interferometers (see section Q}. These instruments 
have a number of advantages o ver single-dish telescopes 
(AMI Consortium : Zwart et al. I 120081 , and references therein), 
including their r e lative insensitivity to atmospheric emission 
dLav & Halversonl l2000h . lack of required receiver stability, 
and t he ease with whic h systematic errors such as ground 
spill (Watson et al. I l2003h and radio-source contamination (see 
e.g. lGrainger et alj2002h can be minimised. 

Assuming a small field size, an interferometer operating at a 
single frequency v measures samples from the complex visibility 
plane I v (u). This is given by the weighted Fourier transform of the 
surface brightness I v (x), 



/„(«) 



A v {x)I u {x) exp(27ra< ■ x) d x, 



(10) 



where x is the position relative to the phase centre, A v (x) is the 
(power) primary beam of the antennas at the observing frequency 
v (normalised to unity at its peak), and u is a baseline vector in units 
of wavelength. Interferometers effectively measure spatial struc- 
tures in the surface brightness 8I V as the large-angular-scale 'DC 
level at the centre of the ui;-plane is never sampled. The positions 
in the ™-plane at which the function /„(«) is sampled are deter- 
mined by the physical positions of the antennas and the direction of 
the field on the sky. The samples Ui lie on a series of curves, called 
itu-tracks. 

In our model, the we assume the measured interferometer 
visibilities contain contributions from the cluster SZ signal, ra- 
dio point sources, primordial CMB anisotropies and instrumental 
noise; these contributions are discussed below. In short, however, 
each interferometer visibility is considered to consist of a signal 
and generalised noise: 



V v {Ui) = h(ui) + N„(m), 



(11) 



where the signal part contains the contributions from the SZ cluster 
and identified radio point sources, and the generalised noise part 
contains the contributions from the unresolved background of ra- 
dio point sources, primordial CMB anisotropies and instrumental 
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5.1 Cluster model 

To determine the contribution to the visibility data from the cluster 
SZ signal, one needs to calculate the Comptonisation parameter of 
the cluster given in ifTJ, for which one must assume a cluster ge- 
ometry as well as temperature and pressure profiles for the cluster 
gas. 

For the cluster geometry, spherically-symmetric models are 
a reasonable first approximation. Ellipticit y can be ad ded easily 
through a coordinate transformation (see Marshall 2003), but we 
will not pursue that here. 

The simplest gas temperature model assumes a single temper- 
ature which is in good agreeme nt with low-resolution X-ray emis- 
sion data (see e.g. lSarazinll 19881) . One could alternatively assume a 
polytropic temperature with 



p g oc pi => T oc pi 



(12) 



with 7 being the polytropic index, and p g and p g the gas pressure 
and density respectively. The polytropic model has been found to 
provi de a good fit to simulated clusters (see e.g. lKomatsu & Seliald 
1200 lb . However, in this work we take the cluster gas to be isother- 
mal. 

The cluster gas density is of t en mo delled with a /3-model 
dCavaliere & Fusco-Femianq 1 19761 , Il978l) . This has the density 
profile 



Pg( r ) 



c(0) 



[1 + (r/r c ) 2 ] : 



(13) 



where r c is the core radius at which the profile turns over into a re- 
gion of approximately constant density, while the outer logarithmic 
slope of the profile is 3/3. 

For simplicity, we will also assume here that the cluster gas 
is in hydrostatic pressure equilibrium with the total gravitational 
potential $ of the cluster, which will be dominated by the dark 
matter. Assuming spherical symmetry, the gravitational potential $ 
must thus satisfy 



dr 



dp 
dr 



(14) 



Assuming the cluster gas to be an ideal gas with temperature T, 
and its distribution to be spherically symmetric, J 14t becomes 



rflog Ps 
d log r 



GM{r)/i 
k-e.Tr 



(15) 



where p is th e mass per particle, a pproximately 0.6 times the pro- 
ton mass (see Marshal l et al.l2003h . G is the universal gravitational 
constant and M{r) is the total mass internal to radius r. We note 
that an alternative approach w ould be to assume a dark matter pro- 
file, such as the NFW profile dNavarro et al 1 1997b . and use the as- 
sumption of hydrostat ic equilibrium to det ermine the correspond- 
ing gas density profile ( Mar shall et al.l2003T) , but we will not pursue 
that here. 

Taking together the assumptions of hydrostatic equilibrium, 
spherical symmetry, a beta profile for the gas density and an isother- 
mal temperature profile, d 1 3 fc and M5\ lead to 



M(r) 



3(3r 3 k B T 
"| + r 2 \xG 



(16) 



This can be used to calculate the total cluster mass out to r2oo, 
the radius inside which the average total density is 200 times the 



critical density p cr it, as 

4.7T 3 

M 2 00 = -g- r 20o(200p C rit 

3/3r| 00 k B T 



(17) 



For the sake of brevity, we shall refer to M200 simply as the total 
mass of the cluster, and write it as M. 

Although the beta model d 1 3 fc for the gas density profile is 
expressed in terms of p g (0), the more interesting parameter is the 
total gas mass M s (r) inside a certain radius r. This can easily be 
accommodated by instead treating M s (r 200) = M s as a model 
parameter and recovering p g (0) by numerically integrating the gas 
density profile to a given radius and subsequently normalizing the 
gas mass within this radius. For this work, a radius of 1 h^ 1 Mpc 
was used as the limit of integration for the gas density profile. 

Thus, the parameters of our cluster model are taken to be T, 
M g , P and r c , along with the position (x c ,y c ) of its centroid on the 
sky and its redshift z. For a given set of values for these parameters, 
a map of the predicted Comptonisation parameter can be calculated 
by evaluating the line-of-sight integral of the projected gas density 
profile: 



y(s) = 



nck&T dl, 



(18) 



where s = 6De is the projected radius, such that r 2 = s 2 + I 2 . 
Since 

p g = n c p c , (19) 

where fx e = 1.14m p dMason & Mvers1l200d : ljones et alJI 1993b is 
the gas mass per electron, d!8t becomes 



y(s) oc 



:(r)Tdl, 



(20) 



where rum defines the radial limit of the integration. We set rn m to 
20ft _1 Mpc which is sufficiently large even for low values of f3. 

It only remains to specify the prior 7r(@ c ) on our cluster 
model parameters © c = (x c , y c ,P, r c , T, M g , z). Pointed SZ ob- 
servations will typically be directed towards (putative) clusters al- 
ready observed in the X-ray or optical bands. Ide ally, one would 
perfo rm a joint analysis of these data sets (see e.g. iMarshall et al.l 
12003b to constrain cluster parameters. For our present purpose of 
analysing SZ data alone, however, one can consider observations 
in other wavebands simply to provide (joint) priors on cluster pa- 
rameters in the analysis of the subsequent SZ data. 

For simplicity, we will assume throughout this paper that the 
prior is separable, except in the parameters M g and z, such that 



tt(0 c ) = 7r(a: c )7r(y c )7r(/3)7r(r c )7r(r)7r(M g , 2 ). 



(21) 



We use Gaussian priors on cluster position parameters, centered 
on the pointing centre and with standard deviation 1 arcmin. We 
adopt uniform priors n(r c ) = W(0, 1000)ft~ 1 kpc and ir((3) = 
W(0.3, 1.5) on the cluster core radius and j3, the outer logarithmic 
slope of the gas density profile. A Gaussian prior on cluster temper- 
ature can be adopted with the mean and standard deviation coming 
from X-ray or optical (from a velocity dispersion) measurements. 
In the absence of any X-ray or optical observation of the cluster, we 
use a uniform prior n(T) = U (0, 20) keV on the temperature. Sim- 
ilarly, optical observations could, in principle, provide a joint prior 
on the mass and redshift of the cluster. In the absence of such ob- 
servations, one can instead assign the prior based upon an assumed 
cluster mass function, as we now discuss. 
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Since it requires little further work, let us consider the very 
general case in which the cluster gas fraction f s is also unknown, 
but we have some prior vr(/ g ) on its value, perhaps from previous 
observations (we will assume for simplicity that f s is independent 
of redshift, although it is easy to relax this condition also). The gas 
mass of a cluster is related to its 'total mass' M (= M200) by 
M g = f g M and hence it is straightforward to show that 

7r(M g , z) = f 7r(/ g )7r(M = M g // g , z) (22) 
Jo Is 

where n(M, z) is the joint prior on the total cluster mass and red- 
shift. In turn, we take the latter to be equal to some assumed cluster 
mass function, d 2 n/dM dz, appropriately normalised over some 
ranges Af min < M ^ M max and z min < z ^ z max . It is worth 
noting that, in the special case where the cluster gas fraction is 
known, one simply has tv(M s , z) oc n(M = M g // g , z). 

One possibility for d 2 n/dMdz is the Press-Schechter 
dPress & Schechtedll974l) mass function. Numerical simulations 
have shown that the Press-Schechter mass function overestimates 
the abunda nce of high-mass clusters and underestimates those of 
low mass dSheth et alJbOQlh . b ut overall it still pro vides an ade- 
quate fit to TV-body simulations djenkins et al]|200ll) . In particular, 
a reasonable fit is obtained for the Press-Schecter mass function 
with erg = 0.8, which is plotted in FigureQ] along with some sam- 
ples drawn from it for illustration. 

Another possibility is simply to assume a separable prior in 
M and z, namely n(M, z) — n(M)ir(z), where each factor has 
some simple functional form such that their product gives a rea- 
sonable approximation to the Press-Schechter mass function. We 
shall assume such a form in our analysis of simulated SZ data in 
Section [6] where tt(M) will be taken to be uniform in logM in 
the range M min = 10 13 ft _1 MQ to M max = 5 x 10 15 h~ 1 M Q , 
and zero outside this range. Moreover, in Section [6] we will also 
assume simply that the cluster redshift and gas fraction are known, 
which is equivalent to imposing delta function priors on z and / g . 



"o - 




redshift z 



Figure 1. The Press-Schechter mass function with as = 0.8, together with 
some samples drawn from it for illustration. The contours enclose 68%, 
95% and 99% of the probability. 



5.2 Resolved radio point-sources model 

A key issue for SZ cluster observations is the effect of contaminat- 
ing radio sources. The most obvious problem is that the emission 
from source(s) coincident with the cluster centre can mask the SZ 
decrement. A more subtle but equally disastrous problem is that 
emission from a radio source lying on a negative sidelobe of the 
interferometer response (centered at the cluster) can mask an SZ 
decrement. AMI's observational approach to this problem is robust: 
with the RT and VSA in the past, and with AMI LA at present, we 
provide enough telescope sensitivity at high angular resolution to 
measure the flux densities at positions of the contaminating sources 
down to a faint limiting flux density Su m . We then handle these 
sources as follows. 

The visibility due to each of these radio sources can be calcu- 
lated as: 

Jl{u) = J A v {x)Su{x)exp{2-KiM ■ x) d 2 x, (23) 

where S v (x) is the source flux at a position x relative to the phase 
centre. Most of these foreground radio sources appear as 'point' 
sources in the data, in which case the visibility can be calculated 
analytically: 

= j A v (x)S v 8(x — x s ) exp(2niu ■ x) d 2 x 

= SuAvixJe**, (24) 

where x s is the position of the radio point source and (j> — 2nu ■ x s 
is the phase angle. If the radio source is extended, then a Gaussian 
profile can be used as a simple model for its flux distribution and 
023b can again be solved analytically. For this work, however, we 
assume all the radio sources to be point sources. 

For SZ observations made at more than one frequency, the 
change of source flux with frequency v also needs to be taken into 
account. Here we assume a power-law dependence with spectral 
index a such that 

S v = So , (25) 

where vq is some reference frequency (usually that of the lowest 
observed frequency channel). 

For each identified radio point source, we impose the same pri- 
ors on its parameters S = (x s , y s ,So,a). We assume throughout 
that the prior is separable: 

tt(© s ) = n{x s )n{y s )n{S )n{a). (26) 

For the source position parameters, we use a delta function prior 
centered on the measured position that is assumed known from 
higher-resolution observations with the AMI LA. Although, such 
observations also yield a source flux measurement that can be used 
to give a very narrow prior on So, for the analysis in this paper, we 
use a uniform prior W(0, 20) mjy to allow our algorithm to deter- 
mine how accurately the source fluxes can be fitted. 

For the spectral index a, a little more care is required in 
the choice of prior. Although radio source spectra typically fall 
with an index of 0.7, they differ widely because of synchrotron 
self-absorption and ageing effects, and so can vary between ~ 2 
(steeply falling), through to » (flat) and even ~ —1 (rising). 
Flat- and rising-spectrum sources tend to be more variable in time, 
and we can straightforwardly account for this b y increasing a 
source 's flux uncertainty for flat or rising sources. IWaldram et al.l 
(2007) found 15-22 GHz spectral indices for a sample of 110 9C 
(Waldr am et al .120031) sources. The modal spectral index was found 
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to be 0.5. We use the distribution (Figure|2j of these spectral indices 
as the prior n(a), binned onto a grid with Aa = 0.1. We expect 
a source's spectral index to be well-constrained by any available 
long-baseline data, such as that from the LA, and so inform the 
prior. 

Finally, since numerous radio sources are typically identified 
in each field, we combine the parameters S for each source in 
to a single radio-sources parameter set = (© s i, @ S 2, • ■ •) and 
we assume the priors on the parameters of different sources are 
separable. 



5.3 Generalised noise model and likelihood function 

As mentioned previously, there are three components that con- 
tribute to the generalised 'noise' on the visibilities dl It : instrumen- 
tal noise, primordial CMB anisotropics and confusion noise from 
the background of unsubtracted radio sources. These contributions 
are discussed bel ow and define the likelihoo d function for the data. 

Following lHobson & Maisinger ( 2002), it is convenient first to 
place the N v i s . v observed complex visibilities V v (ui) in frequency 
channel v into a real data vector d v with components 

f Re{K(«i)} (i < AW) 

d,,i = { 

{ Im{K(«i-jv vis J} (AW + 1 < i < 2AW)- 

(27) 

It will also be useful to define the total data vector d — {d v }, 
which is the concatenation of the individual data vectors for each 
frequency channel v. Similarly, one may define the real noise vec- 
tors n v containing only the noise parts N v (ui) of the complex vis- 
ibilities in dl lb , and the concatenated noise vector n. 

The instrumental noise and primordial CMB anisotropies are 
well described by Gaussian processes. The background of unre- 
solved radio sources is, however, a Poisson process. Nonetheless, 
in the limit of a large number of unresolved sources, this contribu- 
tion can also be well approximated as Gaussian (see below). Thus, 
in this paper, we shall assume a Gaussian form of the likelihood 




-10 12 3 



a 

Figure 2. Spectral index distribution from Wal dram et alj l2007t) used as 
the prior on a, normalized to unit probability at maximum. The final nor- 
malisation depends on the adopted prior range. 



function on the full parameter set © = (© c , 

£(0) = ^exp(-i X 2 ), (28) 

where \ 2 is the standard statistic quantifying the misfit between the 
observed data d and the predicted data d p (&), 

in which C v y = (n v n^,) is the generalised noise covariance 
matrix relating the frequency channels v and v' . The normalisation 
factor Zn in d28t is given by 

Z N = (2tt) (2JV ™ )/2 |C| 1/2 , (30) 

where C = (nn T ) and Avis is the total number of visibilities 
in all the frequency channels. In this work, Zn is independent of 
the model parameters © and hence it can be safely ignored in the 
analysis. 

In d29t , the predicted data df, at each frequency are a function 
of the model parameters © = (® c , For a given set of pa- 
rameter values, the predicted data are calculated as follows. First, 
the values of the cluster parameters ® c are used to calculate the 
predicted Comptonisation map using l |20| l, which in turn gives the 
cluster surface brightness 5I V through © and its weighted (by the 
primary beam) Fourier transform d lOt , calculated on a fine grid in 
the uu-plane. This is then sampled at the measured ttw-postions ttj. 
Finally, the contributions from the identified radio point sources are 
added to the predicted visibilities directly using d24t . 

The covariance matrices C v y in d29l l describe the generalised 
noise on the observed visibilities resulting from instrumental (re- 
ceiver) noise, primordial CMB anisotropies and the background of 
unresolved radio point sources. Since, in this paper, we are assum- 
ing a background cosmology, the covariance matrices C„y are not 
functions of the model parameters © and hence need only be cal- 
culated once (similarly, the Cholesky decomposition of these ma- 
trices, required for the calculation of % 2 , need only be performed 
once). 

Assuming the three contributions to the generalised noise are 
independent, the covariance matrices can be written as 

The first term on the right hand side is a diagonal matrix with 
elements crj iSijS vv / , where ov,j the rms lohnson noise on the 
ith element of the data vector d v at frequency v. The second 
term contains significant off-diagonal elements both between vis- 
ibility positions and between frequencies, and can be calculated 
from a given primary CMB power spectrum Cf MB {v) following 
iHobson & Maisinged ( 2002); note that in intensity units the CMB 
power spectrum is a function of frequency. The third term in J3 1 b is 
the covariance matrix of the source confusion noise, which allows 
for the remaining unresolved radio sources with flux densities less 
than some flux limit Sn m th at have been lef t after high-resolution 
observation and subtraction. IScheuen 1 < fl957b was the first to show 
that the flux from such unknown sources could be considered statis- 
tically as a Poissonian contr ibution (Gauss ian, in the limit of large 
numbers of sources; see e.g. ICondon|[l974l) to the measured signal. 
The response of an interferometer to a radio source has a positive 
signal and both positive and negative sidelobes. Scheuer's work is 
easily modified to the case where all sources with fluxes greater 
than a limiting flux 5n m (perhaps from another telescope) have al- 
ready been subtracted. Assuming the unresolved radio sources to 
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be randomly distributed on the sky leads to a flat angular power 
spectrum for confusion noise, given by (in intensity units) 

rS Ua 



/nconl / \ 



Sn„(S)dS. 



(32) 



where Su m is the completeness limit (at, say, 5<r) of the source 
subtraction survey, and n v (S) = diV„(> S)/dS is the differential 
source count at frequency v as a function of flux S. In principle, 
one should take into account that the unresolved radio point sources 
are not randomly distributed on the sky, and may be concentrated 
within clusters. This would lead to a power spectrum that was a 
function of I, but we do not pursue this further here. In any case, 
the power spectrum Cf {v) can be used to construct the corre- 
sponding confusion noise cova riance matrices in the same way as 
for the CMB contribution (see Hobson & Maisinger 2002 for de- 
tails). 

In our analysis of simulated AMI data in Section [6] we as- 
sume that the differential number count n v (S) does not vary with 
frequency over the AMI band 13.9-18.2 GHz. The deepest source 
counts at fr equencies above ~ 5 GHz are those at 15 GHz from the 
9C survey ( Wal dram et aiT2 003). for which 

n(S) = 51S" 2 ' 15 Jy _1 sr-\ (33) 

based on 465 sources above a 5-<r completeness of 25 mjy. We use 
this count in our analysis. 

5.4 Estimation of model parameters 

The posterior -P(@) <x L(®)n(®) of the model parameters 
= (©c, can be efficiently and robustly explored using the 
poster ior weighted samples produced by the M ultiNest algo- 
rithm jFeroz & Hobsonll2008| - iFeroz et al]|2008h . From these sam- 
ples, one can, for example, construct one-dimensional marginalised 
posterior distributions for each parameter, from which best-fit 
values and uncertainties are trivially obtained. In terms of clus- 
ter modelling, one is interested only in the cluster parameters 
© c = (x c ,y c , f3,r c ,T, M g , z), whereas the parameters VE 1 asso- 
ciated with the resolved radio point sources are considered as nui- 
sance parameters and are marginalised over. It may, however, also 
be of interest instead to marginalised over the cluster parameters 
and produce one-dimensional marginals for the flux So of each re- 
solved radio source, as well as its spectral index a. 

5.5 Quantification of cluster detection 

Owing primarily to the presence of primary CMB anisotropics, it 
is extremely important to quantify SZ cluster detection. We now 
discuss how one may calculate the probability that the observed 
field does indeed contain a real cluster above some particular mass 
limit of interest. 

This quantification is most naturally performed via a Bayesian 
model selection by evaluating the evidence associated with 
the posterior for competi ng models for the data (see e.g. 
iHobson & McLachlanll20Q3l) . It is convenient to consider the fol- 
lowing models (or hypotheses): 



Hq = 'a cluster with M R 



< M g ^ M g H m is centred in S", 



Hi = 'a cluster with M g ,n m < M g < M gimax is centred in 5", 

where S is the total prior region in the spatial subspace x c = 
(x c , y c ). Here M gjm i n is the lower limit of our assumed prior range 
on the cluster gas mass; hence clusters below this minimum mass 
are supposed not to exist. Similarly, M g , max is the upper limit of 



our assumed prior range. Finally, M gi n m is the limiting gas mass 
of interest that we discuss in more detail below. 

We must calculate the model selection ratio R given in {3]> 
between the hypotheses Ho and Hi. For each hypothesis Hi (i = 
0, 1), the evidence is given by 



Zi = 



j C(®)in(®)d®, 



(34) 



where 



7Ti(0) = 7r i (x c )7r i ( ) 3)7ri(r c )7r j (T)7r i (Mg,«)7r j (*), (35) 

for i — 0,1, are priors that define the hypotheses. In particu- 
lar, the priors on all the cluster parameters and source parameters, 
apart from M g and z, may be taken to be the same as those dis- 
cussed above for both hypotheses. Differences between the priors 
for the two hypotheses do occur in TVi(M g , z), but in a straight- 
forward manner. For hypothesis Ho, we use 7r(M g ,z) given in 
( 122b . but now appropriately normalised over the range M giJn i n < 
M g ^ M g! n m , and the prior is zero outside this range. Similarly, 
for hypothesis Hi , we use i22\ appropriately normalised over the 
range M gj ii ffl < M g < M gjmax , and the prior is zero outside this 
range. The evidences < |341 > for i = 0,1 are easily obtained using the 
MultiNest algorithm. 

So far we have not addressed the prior ratio Pr(-Hi)/ Pr(_ffo) 
in l[3}. This is easily obtained from the prior distribution 7r(M g , z) 
in {22}, and is given by 



)dM dz 



Pr(ffi) _ Jjrr Jm^z x *( m * 

Pr(-ffo) ~ / A " E ' llm 7r(M g , z) dM dz 



(36) 



It is worth noting that, in the case where the cluster gas fraction / g 
is assumed known, the above prior ratio is simply 



Pr(ffi) _ IzZ7 [m 



ax//g 
J ft 



■ dM dz 



d 2 n 



nin//g dMdz 



dM dz 



(37) 



where d 2 n/dMdz is the assumed cluster mass function, i.e. the 
distribution of the projected number density of clusters in a given 
mass and redshift bin per unit area. Moreover, if it is also assumed 
that the cluster redshift is known to be z — z c , then l !37t reduces to 



Pr(ffi) 
Pr(ffo) 



M g 



ax//g 
J ft 



dn I 

dM \z—z 



dM 



lim/ /g dn I 
„i„//g dM I 2 = 2 



dM 



(38) 



We are thus able to calculate the model selection ratio R in 
l[3}, which gives us the relative probability that the field contains a 
'true' cluster, with gas mass above the limit M g ,ii m , as opposed to 
'false' cluster, with gas mass below this limit. This, in turn, allows 
us to calculate the probability that the field contains a 'true' cluster, 
which is given by 



P : 



R 



R 



(39) 



6 APPLICATION TO SIMULATED SZ OBSERVATION 

In this section we describe the results of our SZ cluster modelling 
algorithm on simulated SZ cluster data-sets from the Arcminute 
Microkelvin Imager (AMI). 
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Figure 3. Maps made from channel 4 (14.992 GHz) of the simulations considered in this work with Simulation 'A' in the left panel and Simulation 'B' in the 
right panel. The unCLEANed maps are 512 X 512 pixels at 15" resolution. Both simulations have the same realisation of primary CMB and instrumental 
noise. The three radio sources are also the same between the two simulations. Simulation A' has a spherically-symmetric, isothermal /9-model cluster at the 
centre of the map. Simulation 'B' has no cluster. 



6.1 Simulated AMI data-sets 

In simulating mock skies and observing the m with a model AMI 
SA, w e ha ve used the methods outlined in lHobson"& Maisinger 
d2002h and iGrainge et ail |2002). We consider two simulations. 
Simulation A' has a cluster at z — 0.3 modelled as a spherically- 
symmetric isothermal /3-profile with r c = 60", f3 = 0.65, n a — 
10 -2 cm -3 and T — 8 keV. The gas profile is linearly tapered to 
zero between 20r c and 20.01r c . The Comptonisation y-parameter 
for this model is evaluated on a cube whose face has 512 x 512 
pixels at 30" resolution before being integrated along the line of 
sight. Radio point sources are added to these maps using the fluxes, 
positions and spectral indices given in Table [2] 

The uw-positions of visibility points are simulated by 
calculating the baselines (assumin g SA antenna positions in 
lAMI Consortium: Zwart et alj|2008l) for a target at right ascension 
q = 4 hours and declination 8 = +40° observed over hour angle 
±4 hours with one-second sampling. 

For each simulation, a realisation of the primary CMB is cal- 
culated using a power spectrum o f primary anisotro pies was gener- 
ated for I < 8000 using CAMB dlxwis et alj2000h . with a ACDM 
cosmology (f2 m = 0.3, J1a = 0.7, as = 0.8 and h = 0.7) as- 
sumed. Primary CMB modes on £ scales outside the range mea- 
surable (£ » 500-8000, considering the most extreme frequency 
channels) by the SA are set to zero. The CMB realisation is co- 
added to the cluster and radio source map in brightness tempera- 
ture. To each model sky we also add a population of faint, confus- 



Aic/arcsec 


Ay/arcsec 


Sls/mJy 


a 


1 8 


10 


5 





2 


-5 


15 


+1 (falling) 


3 -3 


g 


g 


-0.3 (rising) 



Table 2. Contaminating radio sources for the simulations considered in this 
work. Source positions are given in arcminute offsets from the pointing 
centre. The flux and spectral index are at 15.0 GHz. 



ing radio point sources, uniformly distributed on the sky but drawn 
from a Poisson distribution in flux, with the 9C source count (see 
section [531 > between 10 fiiy and Su m = 200 fiiy. The map is scaled 
by the primary beam appropriate to the measured value in that fre- 
quency channel and transformed into the Fourier plane (equivalent 
to Fourier transforming and convolving with the aperture illumi- 
nation function). The resulting function is sampled at the required 
visibility points and thermal receiver noise, appropriate to the mea- 
sured sensitivity of the SA, is added at this stage. 

The whole process above is repeated for each of the six fre- 
quency channels. Simulation 'B' is identical to Simulation 'A', but 
has no cluster. Maps made from the simulated visibilities for chan- 
nel 4 (14.992 GHz), for both models, are shown in Figure[3] 

6.2 Analysis and results 

We analysed the cluster simulations discussed above assuming a 
cluster model with spherical geometry, a beta profile for the gas 
and isothermal temperature. The priors used are listed for conve- 
nience in Table [3] Positions of the radio point sources were fixed 
to their true values. The primordial CMB and confusion noise were 
included through the covariance matrix as discussed in Section [573l 
For the confusion noise we used the 9C source count with limiting 
flux Sum ~ 200/iJy (correctly) and the count cut off below 10 /iJy. 



Parameters 


Priors 




(0±60)" 


M g 


12 < log 10 M e /h- 2 M@ < 14.5 


T 


< T/keV < 20 


r c 


< rc/h' 1 kpc < 1000 


P 


0.3 < P < 1.5 


So 


< 5/mJy < 20 



Table 3. Priors for the cluster and source parameters. Inequalities denote 
uniform prior probability between the given limits, whilst (a ± ft) denotes 
a Gaussian prior with mean a and variance ft 2 . 
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Figure 4. 2-D marginalized posterior probability distributions for the parameters of the cluster Simulation 'A' discussed in Section [6] The true parameter 
values used in the simulation are shown by crosses and vertical lines in 2-D and 1-D marginalisations respectively. 



For the primordial CMB anisotropies, we (correctly) assumed a 
ACDM cosmology with fi m = 0.3, Qa = 0.7, <t 8 = 0.8 and 
h — 0.7. We also assumed that the redshift of the cluster z = 0.3 
and the gas fraction f s = 0.1 were known. We analysed all six 
AMI frequencies channels jointly. In quantifying our cluster detec- 
tion, we adopted a gas mass limit of M gl i im = 1O 13 /i _1 M and 
assumed a Press-Schechter mass function. 

For the Simulation 'B' (with radio point sources and no clus- 
ter) MultiNest did identify a dominant peak in the posterior dis- 
tribution of cluster parameters but the probability odds ratio 7?, as 
discussed in Section l531 was evaluated to be 0.32 ± 0.03, showing 
that it is more than twice as likely that the field did not contain a 
'true' cluster with a gas mass above the mass limit of interest. Since 
there is no cluster in the field, the highest likelihood point comes 
from a large negative primordial feature, but since the statistics of 
the primordial CMB have been incorporated in the likelihood eval- 
uation through the covariance matrix, the Bayesian model selection 
takes this into account and consequently the odds ratio is in favour 



of the detected feature being 'false'. To verify this assertion, we 
analysed Simulation 'B' without including the CMB component in 
the covariance matrix, in which case the probability odds ratio for 
cluster detection R was evaluated to be « 150 which clearly shows 
that including the CMB is extremely important for properly mod- 
elling galaxy clusters through the SZ effect. 

For the Simulation 'A' (with cluster and radio point sources), 
the probability odds ratio for cluster detection R was evaluated 
to be e 12 ' 2±0 - 2 ; showing an overwhelming evidence in favour of 
a 'true' cluster detection. We plot the 2-D and 1-D marginalized 
posterior distributions of the cluster parameters along with the true 
parameter values used in the simulation in Figures[4]and[5]respec- 
tively. In Figure [5] we also plot the prior distributions imposed on 
the parameters. The inferred cluster parameter means and l-cr un- 
certainties are listed in Table [4] It is clear from this table and the 
posterior plots that all the model parameters have been estimated 
to reasonable accuracy. It can also be seen that the posterior for 
the cluster temperature is largely unconstrained and the expected 
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Figure 5. Priors and 1 -D marginalized posterior probability distributions for the parameters of the simulated cluster and radio point sources discussed in 
Section[6] The true parameter values used in the simulation are shown by vertical lines. 



degeneracy between T and Mg is also evident. Clearly some addi- 
tional information on cluster T is required to get a sensible estimate 
for M g . This information can come from the X-ray observation of 
the same cluster or from an optical velocity dispersion measure- 
ment. T calculated in such a way can be used as a prior for the 
analysis of the cluster. The result of a temperature measurement of 
(8 ± 2) keV applied as a Gaussian prior on T is shown in Figure[6] 
and|7] 

It is also of interest to investigate the importance of modelling 
the three radio point sources, with properties given in Table[2] that 
were added to Simulation 'A'. Assuming known source positions 
and allowing the source fluxes to vary, results in a massive increase 
in evidence over that for an analysis (incorrectly) assuming no radio 
point sources. 



P(D\3i 



P(D|No sources) 



(40) 



We list the inferred cluster parameters for the analysis (incorrectly) 
assuming no radio point sources in Table [5] Comparing the param- 
eter values in Table [5] with the values used in the simulation, it is 



Parameters 


Inferred values 


Xc/arcsec 


12 ± 11 


y c /arcsec 


3 ± 10 


r c /h~ x kpc 


276 ± 177 


P 


0.8 ±0.3 


M e /h~ 2 M & 


(1.5 ±0.5) x 10 14 


T/keV 


10 ±5 



Table 4. Inferred cluster parameters values for the analysis with cluster Sim- 
ulation 'A' discussed in Section|6] All noise sources discussed in Section l531 
were included in the covariance matrix and the observed radio sources were 
treated as nuisance parameters. The probability odds ratio for cluster detec- 
tion R was evaluated to be e 12 - 2 ±0- 2 . 




Figure 6. 2-D marginalized posterior probability distributions for T and 
M g for the analysis of the cluster Simulation 'A' discussed in Section \§\ 
The true parameter values used in the simulation are shown by crosses. 
Uniform W(0, 20) keV and Gaussian (8 ± 2) keV priors on T were used 
for the figures on the left and right hand-panels respectively. 



■ U(0, 20) keV 



T = (8 ± 2) keV 



A/ g /lO 14 /i- 2 M 
Figure 7. 1-D marginalized posterior probability distributions for Mg for 
the analysis of the cluster Simulation 'A' discussed in Section [6] The true 
Mg used in the simulation is shown by the vertical line. 
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clear that the model used in the analysis is incorrect. The algorithm 
is trying to fit for the cluster assuming no radio point sources in 
the field, but the presence of radio point sources is forcing the al- 
gorithm to compensate by preferring larger cluster r c and /3. This 
highlights the importance of radio point souce information while 
analyzing the SZ data. 

Finally, to investigate the importance of including the noise 
contribution due to unsubtracted radio sources, we analysed Sim- 
ulation 'A' again but this time without including the contribution 
from the confusion noise to the covariance matrix. The priors on the 
cluster and radio source parameters used were the same as listed in 
Table[3] This resulted in the probability odds ratio for cluster detec- 
tion R of e 50 - 8±0 ' 2 5 again showing a very strong evidence in favour 
of a true cluster detection. We list the inferred cluster parameters 
for this analysis in Table|6] It can be seen from this table that while 
most of the cluster parameters have been inferred to reasonable ac- 
curacy, the inferred cluster position is 5<r away from the true centre. 



Parameters 


Inferred values 


i' c /arcsec 


13 ± 10 


j/ c /arcsec 


40 ±8 


r c /h~ 1 kpc 


296 ± 175 


& 


0.8 ±0.3 


M g /h- 2 M Q 


(1.6 ±0.5) x 10 14 


T/keV 


10 ±5 



Table 6. Inferred cluster parameters values for the analysis with cluster 
simulation 'A' discussed in Section [6] and (incorrectly) assuming no un- 
subtracted radio sources below Sii m . 



there are several degeneracies in cluster parameters when modelled 
through the SZ effect only, the most significant of which are be- 
tween r c and /3, and between T and M s . Better constraints on 
these parameters can be obtained by using information from dif- 
ferent waveband observations. 



7 CONCLUSIONS 

Extracting and parametrizing clusters in SZ data is an extremely 
challenging task due to the presence of (both observed and 
unsubtracted) contaminating radio sources and primary CMB 
anisotropies. We have described an efficient approach using the 
MultiNest algorithm to model galaxy clusters in multi-frequency 
pointed SZ observations, in the presence of radio point sources. 
The parameters of the radio sources are treated as nuisance param- 
eters, which allows for the fitting of these parameters simultane- 
ously with the cluster parameters and consequently for the uncer- 
tainties in the measurements of the radio source parameters to be 
propagated to the cluster parameter inferences. We considered the 
three main sources of noise for SZ observations: (a) receiver noise, 
(b) primary CMB anisotropies and (c) confusion noise due to un- 
subtracted radio sources. We have also shown that it is extremely 
important to take into account all these noise contributions in the 
analysis to get the correct posterior distributions for the cluster pa- 
rameters. Even with this extra complexity, we are able to analyse 
a pointed SZ cluster observation with six frequency channels on a 
single Intel Woodcrest 3.0-GHz processor in about five hours. The 
work presented in this paper is limited to pointed cluster obser- 
vations. We plan to extend this methodology to multi-field survey 
observations in a future study. The code is fully parallel, making 
our algorithm a viable option for even the deepest SZ surveys. 

Our analysis methodology is easily extendable to do a joint 
analysis of clusters using SZ, lensing and X-ray data-sets. This data 
fusion is extremely important for understanding cluster physics, as 



Parameters 


Inferred values 


:r c /arcsec 


14 ±10 


y c /arcsec 


24 ±8 


r c /h~ 1 kpc 


546 ± 65 


P 


1.4 ±0.1 


MJh~ 2 M & 


(2.1 ±0.8) x 10 14 


T/keV 


11 ± 5 



Table 5. Inferred cluster parameters values for the analysis with cluster Sim- 
ulation 'A' discussed in Section |6]and (incorrectly) assuming no observed 
radio point sources. 
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